Biological Imaging
◐ Cambridge University Press (CUP)
Preprints posted in the last 90 days, ranked by how well they match Biological Imaging's content profile, based on 15 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Miao, Y.; Surguladze, N.; Lerner, J.; Poysungnoen, K.; Ariano, K.; Li, Y.; Zhu, Y.; Van Batavia, K.; Jepson, J.; Van De Klashorst, J.; Ni, B. Y. X.; Armstrong, A.; Rahman, R.; Horstmeyer, R.; Hickey, J. W.
Show abstract
Accurate cell segmentation is an essential step for quantitative analysis of biological imaging data. Recent advances in deep learning have led to the development of generalist segmentation models that perform robustly across multiple imaging modalities, including label-free phase contrast, fluorescence cell culture, and multiplexed fluorescence tissue imaging. However, systematic comparisons of these models at the level of downstream biological analysis remain limited. To address this gap, we evaluated several recent segmentation models, including Cellpose cyto3, Cellpose-SAM, {micro}SAM, and CellSAM, on phase contrast and fluorescence cell culture images. In addition, Mesmer and InstanSeg were included for benchmarking on multiplexed fluorescence tissue images generated using CO-Detection by IndEXing (CODEX). We found that Cellpose-SAM achieved strong performance on phase contrast images, while SAM-based models consistently performed well on fluorescence cell culture data. In contrast, no single model consistently outperformed others on CODEX datasets. Instead, each model exhibited distinct strengths and limitations, which led to differences in downstream analyses, including clustering and cell type identification. Together, our study emphasizes the importance of selecting segmentation models based on dataset characteristics and analytical goals, rather than relying on a single universal approach.
Hogendorn, C.; R. Aragon, I.; Dallon, S.; Batchelor, E.
Show abstract
To properly respond to their environment, cells adjust the activity of key regulatory proteins and rates of gene expression. Methods to detect and quantify these forms of regulatory dynamics in living cells are of central importance for understanding cellular signaling events in both physiological and pathological conditions. Current technologies in this field make use of fluorescent probes to track cell signaling dynamics. Although these technologies have been used for decades, challenges remain. In particular, the segmentation, tracking, and interpretation of single cell dynamic data are time-consuming, prone to subjective errors, and often lacking in standardization across experiments. Here, we present SPIFEE, a data pipeline that uses experiment-dependent parameters to smooth noise and quantify key features of fluorescence data from time-lapse imaging studies. Processing data in this manner enhances and accelerates quantification of live-cell gene and protein expression, simplifies data analysis, and facilitates hypothesis generation. Author SummaryCells adjust protein activity and gene expression levels over time to respond to changes in their environment, a process referred to as cell signaling dynamics. Quantifying cell signaling dynamics in living cells often uses fluorescent probes, such as green fluorescent protein (GFP) and its spectral variants, to track changes in gene expression or protein activity over time. Challenges inherent in analyzing fluorescence data from single cells stem from biological and experimental noise, time-consuming quantification, and subjective errors. To address these challenges, we developed a computational tool called Signal Processing and Integrated Feature Extraction (SPIFEE). The pipeline improves the quality of fluorescence data analysis by reducing noise and extracting signal features in a way that is both intuitive and objective. The pipeline provides more accurate, rapid, and unbiased quantification of time-lapse microscopy data.
Peale, F. V.; Perng, W.; Mbiribindi, B.; Andrews, B. T.; Wang, X.; Dunlap, D.; Eastham, J.; Ngu, H.; Chernyshev, A.; Orlova, D.
Show abstract
The immunohistochemistry (IHC) methods widely used in diagnostic medicine and biomedical research are kinetically complex reaction-diffusion processes that, ideally, produce stain intensities correlated with the local antigen concentration. Yet after 75 years of use, practical theoretical tools to rigorously plan and interpret IHC experiments are still lacking. Because modeling the reactions requires time-consuming computer simulation, impractical for regular use, most protocols are optimized empirically, without detailed knowledge of the reaction rates and antigen-antibody equilibria. The resulting stain intensities can be calibrated against standards with known antigen abundance, but they are typically not interpretable in terms of chemical antigen concentrations. To address these limitations, we developed a fast interpolation method to model reaction-diffusion behavior, and experimental methods to characterize IHC kinetic parameters in formalin-fixed paraffin-embedded (FFPE) samples. Used together, these allow experimental measurement of both the chemical concentration of antigen in the sample and the reaction-diffusion parameters consistent with the assay results. Results show 1) direct immunofluorescent detection has low nanomolar sensitivity with >1000-fold dynamic range, and 2) antibody diffusion rates in FFPE samples can be >1000-fold slower than in aqueous solutions, producing diffusion-limited conditions in which the IHC reaction time course may depend on the sample antigen concentration. Awareness of these details is necessary to avoid potential underestimation of both the absolute and relative antigen concentrations in different samples that may occur if staining is stopped before reaching equilibrium. Software tools are provided to allow users to rapidly model IHC reaction time courses and to fit experimental time course data with candidate reaction parameters. The principles described here apply equally to other tissue-based "spatial omics" analyses and should be considered when designing and interpreting experiments requiring any macromolecule to diffuse into and react in a tissue section. SIGNIFICANCEThe theoretical and experimental framework described here advances IHC staining from a qualitative or semi-quantitative method towards a more rigorously quantitative assay. The practical ability to predict IHC reaction kinetics and fit reaction parameters to experimental data has the potential to advance IHC applications in diagnostic medicine and biomedical research in three ways: 1) interpretation of experimental and diagnostic samples stained under different conditions can be more objective, facilitating comparison of results from different protocols and different laboratories; 2) IHC staining can be interpreted as molar chemical antigen-antibody concentrations calculated from the reaction parameters measured in the studied sample; 3) the correlation between antigen concentration and biological behavior can be examined more reliably. Practical software tools are provided.
Lee, R. M.; Eisenman, L. R.; Hobson, C.; Aaron, J. S.; Chew, T.-L.
Show abstract
Motion is an essential component of any living system. It is rich with information, but it is often challenging to quantitatively extract biologically informative results from the motion apparent in microscopy images. This challenge is exacerbated by the wide variety in biological movement, which often takes the form of difficult-to-segment amorphous structures undergoing complex motion. An image processing technique known as optical flow can capture motion at each pixel in an image, thus bypassing the need for object segmentation or a priori definition of motion types. This makes it a powerful tool for quantitative assessment of biological systems from the protein to organism scale. However, despite its flexibility and strengths for analyzing fluorescence microscopy images, its adoption in the bioimaging community has been limited by the availability of easy-to-use tools and guidance in results interpretation. Here we describe an optical flow tool, OpticalFlow3D, that can be run in Python or MATLAB and is compatible with three-dimensional microscopy images. Using biological examples across length scales, we illustrate how OpticalFlow3D can enable new biological insight.
Van De Vijver, E.; Dewitte, K.; Van Alboom, A.; Christophe, A.; Van Vlierberghe, H.; Van Troys, M.
Show abstract
Three-dimensional microtumour models such as spheroids are increasingly used in cancer research as they better capture tumour architecture, growth and invasion than conventional two-dimensional cultures. However, robust and accessible tools for quantitative analysis remain limited. Here we present SImBA-SiQuAl, an integrated open-source workflow for high-throughput quantitative phenotyping of 3D spheroids and organoids. The pipeline combines SImBA, an automated image-analysis framework for performant quality-controlled image segmentation and multi-feature extraction from spheroid assays, with SiQuAl, a downstream analysis platform that automatically performs comprehensive statistical and multivariate analyses to reveal phenotypic differences between experimental conditions. In a first case study, SImBA-SiQuAl resolves intrinsic invasion phenotypes between cancer cell lines. In a second case study, the workflow quantifies both uniform and heterogeneous responses in a spheroid drug screening assay. Together, SImBA-SiQuAl provides a new, timely tool for high-throughput, high-content microtumour phenomics in cancer research. MOTIVATION3D-microtumour assays such as spheroids and organoids are increasingly used in preclinical research. These assays generate rich phenotypic imaging data, but quantitative automated analysis remains a major bottleneck. This limits reproducibility, scalability, and broad adoption for large-scale, high-content phenomics studies, but also implies biologically relevant phenotypic (heterogeneous) responses in e.g. perturbation studies may not be comprehensively addressed. SImBA-SiQuAl is developed to address this gap by providing an open-source, integrated workflow offering solutions in both the image processing and downstream analysis. Together, this enables in-depth quantitative analysis of 3D microtumour phenotypes across experimental settings. HIGHLIGHTSO_LISImBA-SiQuAl provides a complete end-to-end workflow for high-throughput, high-content, quantitative 3D microtumour analysis, from quality-controlled image segmentation to statistical, multivariate and cluster-based biological interpretation. C_LIO_LISImBA-SiQuAl is broadly applicable across multiple 3D systems and assay types. C_LIO_LIWe demonstrate the workflow can capture biologically meaningful heterogeneity and treatment response at scale, supporting robust and unbiased analysis. C_LIO_LIBy combining accessibility, flexibility and analytical depth, SImBA-SiQuAl addresses a key unmet need for accessible advanced open-source tools in 3D preclinical research. C_LI
Fan, B.; Bilodeau, A.; Beaupre, F.; Wiesner, T.; Gagne, C.; Lavoie-Cardinal, F.; Hlozek, R.
Show abstract
SignificanceFluorescence-based Ca2+-imaging is a powerful tool for studying localized neuronal activity, including miniature Synaptic Calcium Transients, providing real-time insights into synaptic activity. These transients induce only subtle changes in the fluorescence signal, often barely above baseline, which poses a significant challenge for automated synaptic transient detection and segmentation. AimDetecting astronomical transients similarly requires efficient algorithms that will remain robust over a large field of view with varying noise properties. We leverage techniques used in astronomical transient detection for miniature Synaptic Calcium Transient detection in fluorescence microscopy. ApproachWe present Astro-BEATS, an automatic miniature Synaptic Calcium Transient segmentation algorithm that incorporates image estimation and source-finding techniques used in astronomy and designed for Ca2+-imaging videos. Astro-BEATS uses the Rolling Hough Transform filament detector to construct an estimate of the expected (transient-free) fluorescence signal of both the dendritic foreground and the background. Subtracting this baseline signal yields difference images displaying transient signals. We use Density-Based Spatial Clustering of Applications with Noise to find sources clustered in spatial and temporal space. ResultsAstro-BEATS outperforms current threshold-based approaches for synaptic Ca2+ transient detection and segmentation. The produced segmentation masks can be used to train a supervised deep learning algorithm for improved synaptic Ca2+ transient detection in Ca2+-imaging data. The speed of Astro-BEATS and its applicability to previously unseen datasets without re-optimization makes it particularly useful for generating training datasets for deep learning-based approaches. ConclusionAstro-BEATS greatly reduces the time needed for the annotation of synaptic Ca2+ transient and removes the significant overhead of human expert annotation, enabling consistent analysis of new Ca2+-imaging datasets.
Bright, M.; Mi, X.; Duarte, D.; Carey, E.; Lyu, B.; Wang, Y.; Nimmerjahn, A.; Yu, G.
Show abstract
BackgroundAdvanced biological imaging analysis platforms such as Activity Quantification and Analysis (AQuA2) enable accurate spatiotemporal activity analysis across diverse cell populations within many species. These tools are increasingly important for investigating cellular signaling dynamics and behavior. However, despite advances in the accuracy and species capability of AQuA2, it remains computationally demanding for analysis of long time-series datasets and requires all users to maintain a MATLAB license, which may limit accessibility and large-scale deployment. ResultsTo address these limitations, we have designed and made available AQuA2-Cloud, a portable software stack and web platform developed as an improvement and further evolution of AQuA2. This container-deployable system permits multi-user cloud-based high accuracy activity quantification with intuitive workflows, export of analysis data and project files, and comparable processing times. The platform offers integrated features such as in-browser analysis control interfaces, asynchronous program state control, multiple users and user management, support for unreliable connections, file uploading and downloading via web browsers and File Transfer Protocol, and centralized organization of analysis output. ConclusionAQuA2-Cloud constitutes a cloud-native solution for laboratories or research groups seeking to centralize analysis of spatiotemporal biological imaging datasets while reducing software installation and licensing barriers for end users. The platform enables researchers with minimal technical expertise to perform advanced bioimaging analysis through standard web browsers while maintaining the analytical capabilities of AQuA2. AQuA2-Cloud source code, deployment procedures, and documentation are freely available at (https://github.com/yu-lab-vt/AQuA2-Cloud).
Stenberg, J.; Gullapalli, A.; Foucar, K.; Babu, D.; Redemann, J.; Joste, N.; Foucar, C.; Gratzinger, D.; George, T.; Ohgami, R.; Gullapalli, R. R.
Show abstract
Digital Pathology (DP) is a fast-emerging branch of pathology focused on digitizing pathology data. A key challenge of DP usage for pathology laboratories, especially mid- to small-sized clinical labs, are the upfront costs associated with instrumentation and the logistical challenges of implementation. In the current project, we built an end-to-end DP solution using low-cost, open-source components that is user-friendly at a small scale. We repurposed readily available microscopy components in a pathology lab to assemble a fully functional DP pipeline for translational research applications. We tested multiple low-cost complementary metal-oxide semiconductor (CMOS) cameras in this project and chose a user-friendly Canon camera for image acquisition. An open-source DP server solution, OMERO v.5.6.4, was used as the image management system (IMS) to host and serve the WSIs on an Ubuntu 22.04 operating system. The server-hosted WSI images were evaluated remotely and asynchronously by multiple pathologists physically situated in Albuquerque, NM; Salt Lake City, UT; and Palo Alto, CA. Each pathologist assessed the quality of the WSI pipeline, image quality, and WSI interaction experience using a 23-question survey. Overall, the custom, low-cost WSI pipeline was noted to be a robust and user-friendly experience by the pathologists. The current DP setup is unlikely to be useful as a commercial, scalable DP pipeline for large-scale clinical applications. However, it demonstrates the feasibility of creating customized, small-scale DP solutions (at a low price point) for asynchronous translational pathology research applications. Additionally, building customized DP pipelines provides excellent educational opportunities for pathology residents to gain in-depth knowledge of the various technical elements of a DP workflow. In summary, we have established a low-cost, end-to-end WSI DP pipeline useful for spatiotemporally asynchronous translational pathology research, in an academic setting.
Bhattiprolu, S.
Show abstract
1Three-dimensional organoid cultures have emerged as powerful models for studying human tissue biology, disease mechanisms, and drug responses. Fluorescence confocal microscopy of organoids generates complex volumetric image data that is increasingly analyzed using deep learning pipelines for cell segmentation, morphometry, and phenotyping. However, training and benchmarking such pipelines requires large annotated datasets, the manual curation of which is prohibitively expensive and time-consuming. Here we present a parametric, physics-based computational framework for generating synthetic 3D fluorescence organoid images with exact ground-truth cell body and nucleus label masks. The framework models cell placement using force-directed sphere packing with optional hollow lumen exclusion for cyst-forming organoids, cell morphology using power-diagram (Laguerre) tessellation with apical-basal elongation and surface flattening for polarized epithelial cells, membrane curvature using low-frequency coordinate displacement, nuclear shape using irregular ellipsoid deformation with smooth radial eccentricity direction blending, and optical effects using depth-dependent point-spread function broadening, a physically motivated staining diffusion gradient with residual interior plateau, z-attenuation, haze, shot noise, and channel crosstalk. The necrotic core model uses a three-phenotype nuclear population, pyknotic, ghost, and karyorrhectic, reflecting the histological diversity of real necrotic zones. Five condition-specific presets are provided, calibrated to published biological measurements and covering PDAC osmotic stress, HMECyst normal and cyst-forming organoids, and a large primary PDAC organoid with a necrotic core. Unlike generative adversarial network approaches, our method requires no training data, produces exact ground truth by construction, and allows systematic and interpretable control over every morphological and optical parameter. The framework is released as open-source Python software with a PyQt5 graphical interface and produces OME-TIFF output compatible with arivis Pro, FIJI, and napari, as well as most other microscopy image analysis software.
Brito Pacheco, D.; Giannopoulos, P.; Reyes-Aldasoro, C. C.
Show abstract
This paper investigates the way in which mitochondria distribute and align inside HeLa cells observed with serial block-face scanning electron microscopy. Four models of alignment were considered: (1) mitochondria exhibiting no discernible alignment pattern, (2) mitochondria aligned pointing towards the nucleus of the cell, (3) mitochondria aligned all in one direction when viewed from above, (4) mitochondria aligned tangent to the surface of the nucleus. These models were named (1) unaligned, (2) petals, (3) racecars, and (4) clouds. The mitochondria, nucleus and plasma membrane of 25 individual cells were segmented. A total of 12,299 mitochondria were identified and analysed. Alignment of the major axis of each mitochondrion was calculated in two ways: relative to a ray that joins it to the centroid of the nucleus, and relative to a ray that joins it to the nucleus surface. Results indicate that mitochondria tend to align tangentially to the nucleus surface, i.e., a clouds model. In addition, differences in the spatial distributions of the mitochondria were found and quantified with clearly defined metrics. The methodology here presented can be extended to other acquisition settings where the distribution and alignment of cells could be important, for instance, histopathology.
Shtengel, D.; Shtengel, G.; Xu, C. S.; Hess, H. F.
Show abstract
Electron Microscopy (EM) is widely used in many scientific fields, particularly in life sciences, offering high-resolution information on the ultrastructure of biological organisms. Accurate characterization of EM image quality is important for assessing the EM tool performance, in addition to sample preparation protocol, imaging conditions, etc. This paper provides an overview of tools we developed as plugins for the popular image processing package Fiji (ImageJ) (1). These tools include signal-to-noise ratio analysis, contrast evaluation, and resolution analysis, as well as the capability to import images acquired on custom FIB-SEM instruments (2). We have also made these tools available in Python, with both versions available on GitHub.
Letort, G.; Valon, L.; Michaut, A.; Cumming, T.; Xenard, L.; Phan, M.-S.; Dray, N.; Rueden, C. T.; Schweisguth, F.; Gros, J.; Bally-Cuif, L.; Tinevez, J.-Y.; Levayer, R.
Show abstract
Investigating single-cell dynamics and morphology in tissues and embryos requires highly accurate quantitative analysis of microscopy images. Despite significant advances in the field of bioimage analysis, even the most sophisticated segmentation and tracking algorithms inevitably produce errors (e.g. : over segmentation, missing objects, miss-connected objects). Although error rate may be small, their propagation throughout a time-lapse sequence has catastrophic effects on the accuracy of tracking and extraction of single cell parameters. Extracting single cell temporal information in the context of tissue/embryo requires thus expert curation to identify and correct segmentation errors. In the movies commonly used in developmental biology and stem cell research, both the number of imaged cells and the duration of recording are large, making this manual correction task extremely time-consuming. This has now become a major bottleneck in the fields of development, stem cell biology and bioimage analysis. We present here EpiCure (Epithelial Curation), a versatile tool designed to streamline and accelerate manual curation of segmentation and tracking in 2D movies of large epithelial tissues. EpiCure uses temporal information and morphometric parameters to automatically identify segmentation and tracking errors and provides user-friendly tools to correct them. It focuses on ergonomics and offers several visualization options to help navigating in movies of tissue covering a large number of cells, speeding up the detection of errors and their curation. EpiCure is highly interoperable and supports input from a wide range of segmentation tools. It also includes multiple export filters, enabling seamless integration with downstream analysis pipelines. In this paper, using movies from several animal models, we highlight the importance of curating cell segmentation and tracking for accurate downstream analysis, and demonstrate how EpiCure helps the curation process for extracting accurate single cell dynamics and cellular events detection, making it faster and amenable on large dataset.
Le, T. X.; Tran, L.-A. T.; Farabi, D. A.; Wang, S.; Phan, A. T. Q.; Cormier, S. A.; Taada, A.; McGrew, D.; Du, Y.; Vu, L. D.
Show abstract
Automated analysis of murine bronchoalveolar lavage fluid (BALF) cytology is important for preclinical respiratory research, yet progress has been limited by the lack of publicly available, well-annotated mouse BALF image datasets. We present MurineCyto-Det, a high-resolution murine BALF cytology dataset comprising 333 image tiles of size 1024x1024 pixels, annotated across five cytological categories with both pixel-level segmentation masks and one-to-one matched bounding boxes. The dataset contains 14,551 annotated cell instances and supports two complementary analysis tasks: morphology-oriented cell segmentation and object-level cell detection. To establish reproducible benchmark baselines, we evaluated representative segmentation and detection models. The results demonstrate the practical utility of MurineCyto-Det while highlighting realistic challenges arising from class imbalance, small object size, irregular cell morphology, and ambiguous debris-like structures. MurineCyto-Det provides a standardized resource for developing, evaluating, and comparing automated methods for murine BALF cytology analysis. The dataset is publicly available at https://doi.org/10.5281/zenodo.17608677.
Pohar, C.; Rekik, Y.; Phan, M. S.; Gallet, B.; Desroches-Castane, A.; Chevallet, M.; Tinevez, J.-Y.; Tillet, E.; Vigano, N.; Jouneau, P.-H.; Deniaud, A.
Show abstract
The liver has a complex architecture composed of millions of lobules. Within these lobules, hepatocytes, the main hepatic cells, are organized in rows separated by blood capillaries known as sinusoids. These capillaries are lined by liver sinusoidal endothelial cells (LSEC) that form a very specific fenestrated endothelium essential for the exchange of metabolites and proteins between the blood and hepatocytes. Alterations in the size and number of LSEC fenestrations are associated with the onset and the progression of various liver diseases. The analysis of liver architecture is thus of utmost importance for advancing our knowledge of liver ultrastructure and its alterations. Liver architecture has been studied since decades, mainly using 2D electron microscopy, and more recently using advanced super-resolution fluorescence microscopy. In recent years, volume electron microscopy techniques, including focused ion beam-scanning electron microscopy (FIB-SEM) progressed and nowadays enable the 3D reconstruction of biological ultrastructures down to nanometer resolution. However, the analysis of large volumes (e.g., several tens of {micro}m3) remains challenging due to various constraints in the segmentation of large datasets. In the current study, we developed a workflow to semi-automatically segment hepatic sinusoids from FIB-SEM mice liver datasets using the CNN-based (convolutional neural network) tool known as "nnU-Net", after fine-tuning a ground truth model. We also implemented tools for semi-automatic quantification of LSEC fenestrae diameters and sinusoid porosity from segmented datasets. This workflow enabled us to compare the distribution of LSEC fenestrae diameters in wild-type versus Bmp9-deleted mice, a hepatic factor known to be involved in fenestration maintenance. Our results confirm the importance of BMP9 for LSEC differentiation. Therefore, the developed methodology represents a valuable tool for characterizing the fenestrated endothelium under various physiological and pathological conditions.
O'Roberts, E.; Panshikar, P. R.; Li-Wang, X.; Avenel, C.; Verron, Q.; Coulier, E.; Bienko, M.; Stadler, C.
Show abstract
Different omics types such as genomics and proteomics all contribute to deciphering biology. Applying these omics approaches in a spatial context helps reveal biology in situ at a single cell level. Here we present a protocol for the combined multiplexed detection of targeted genes using DNA FISH, and proteins using multiplexed immunofluorescence. The protocol is integrated on the commercial PhenoCycler platform and generates one single dataset with gene and protein readout at a single cell level in large tissue sections, allowing for a throughput of thousands to millions of cells. The workflow can be used for characterising malignant cells in large tumor areas based on genetic aberrations, while deciphering the cellular landscape and microenvironment from multiplexed protein detection using immunofluorescence.
Schneider, F.; Trinh, L. A.; Fraser, S. E.
Show abstract
Fluorescent reporters such as fluorescent proteins or chemigenetic indicators are indispensable tools for studying biological processes using light microscopy. Choosing an appropriate fluorescent tag is a crucial step in experimental design not only for imaging but also for quantitative measurements such as fluorescence fluctuation spectroscopy. Two key parameters should be considered: Fluorescent brightness and photo-bleaching. Change to fluorescence intensity due to photobleaching is relatively easy to assess in different biological environments, while brightness is more elusive. Here, we develop and employ a fluorescence correlation spectroscopy (FCS) based excitation scan assay that determines fluorescent protein performance and validate it in tissue culture and zebrafish embryos. We employ our FCS pipeline to compare a set of 10 established fluorescent proteins as well as HALO and SNAP tags for both cellular imaging and measurements of diffusion dynamics with FCS. We show that mNeonGreen outperforms mEGFP in tissue culture and zebrafish embryos. We also compare StayGold variants against other green fluorescent proteins and chemigenetic reporters in tissue culture. Overall, we present a broadly applicable approach for determining fluorescent reporter brightness in the living system of interest.
Wright, G.; Keller, P.; Muter, J.; Brosens, J.; Tejpar, S.; Minhas, F.
Show abstract
Histological images are composed of diverse local structures such as cells, glands, or tissue patches, whose organisation and relative abundance reflect underlying biological processes. While most computational approaches focus on analysing individual features, many clinically relevant signals arise from changes in the composition of these structures rather than isolated measurements. This motivates the need for explicitly modelling how groups of similar instances vary across samples and relate to outcomes. We present DCAFA (Differential Community Abundance and Feature Attribution Analysis), a regression-based framework for analysing hierarchical biomedical data through both compositional and feature-level perspectives. DCAFA groups instances into latent communities representing recurring morphological or phenotypic patterns, and then performs two complementary analyses: (i) community composition analysis, which identifies groups that are enriched or depleted across outcomes, and (ii) feature attribution analysis, which quantifies how instance-level features relate to outcomes directly or within specific communities. Both use generalised linear and mixed-effects models, enabling covariate adjustment and inference through effect sizes, confidence intervals, and false discovery rate control. We demonstrate the utility of DCAFA across multiple biomedical settings, including endometrial histopathology, spatial transcriptomics, multiplex immunofluorescence imaging, and predefined cell-type analyses in colorectal cancer. These examples identify interpretable compositional shifts and context-specific feature associations that are not captured by conventional feature-based approaches. By unifying differential abundance testing and feature attribution within a single statistical framework, DCAFA serves as an openly available toolbox that provides a practical and interpretable means of linking tissue composition with clinical or molecular outcomes in biomedical imaging data. Code available at: https://github.com/wgrgwrght/DCAFA
Wilsenach, J. B.; Fonseca, S.; Ahnert, S. E.; Wojtowicz, E. E.
Show abstract
BackgroundImaging flow cytometry (IFC) provides a high quantity of single-cell morphological data, yet the field lacks open access tools for designing interpretable, bespoke parameters. In particular, rare and atypical cell populations where well annotated data is limited, are negatively affected. ResultsWe present Flow cytometry Feature Importance (FlowFI), an open-source graphical software for bespoke image parameter design and analysis. FlowFI provides a suite of image parameter options combining data across multiple channels and markers, tailored digital noise reduction (reducing noise resulting from common flow cytometry ultra-high image acquisition modalities), and a scalable, unsupervised feature selection pipeline that allows experimentalists to refine image-derived parameters iteratively, with a novel ensemble subsampling approach that provides robust feature importance scoring. We validated FlowFI using data from a rare and heterogenous bone marrow cell type, megakaryocytes, demonstrating that the tool can successfully identify novel, discriminatory morphological features to improve the purity of selected cell populations and gating strategy. ConclusionFlowFIs core functionalities are interacted with through an intuitive user interface for researchers with options to export data directly to common image and flow cytometry software formats. With this in mind, FlowFI offers a scalable way to both feature design, and feature refinement using a range of approaches to manifold learning, augmented by a data efficient bootstrap subsampling approach for unsupervised parameter recommendations in the big data regime. The software also introduces a new feature selection measures based on common manifold learning methods in the space inspired by the Uniform Manifold Approximation and Projection (UMAP) algorithm and finds performance comparable to existing methods. FlowFI provides a versatile testing ground for future developments in broad and dynamically developing areas of research including single cell analysis, label-free sorting and intra- and inter-cellular interaction analysis, while ensuring interoperability with current research workflows. Desktop installation options as well as detailed documentation can be found at https://github.com/EarlhamInst/FlowFI
Lüthi, J.; Cerrone, L.; Comparin, T.; Hess, M.; Hornbachner, R.; Tschan, A.; Glasner de Medeiros, G. Q.; Repina, N. A.; Cantoni, L. K.; Steffen, F. D.; Bourquin, J.-P.; Liberali, P.; Pelkmans, L.; Uhlmann, V.
Show abstract
The rapid growth in microscopy data volume, dimensionality, and diversity urgently calls for scalable and reproducible analysis frameworks. While efforts on the open OME-Zarr format have helped standardize the storage of large microscopy datasets, solutions for standardized processing are still lacking. Here, we introduce two complementary contributions to address this gap: 1) the Fractal task specification, defining OME-Zarr processing units that can interoperate across computational environments and workflow engines, and 2) the Fractal platform, using this specification to enable scalable and modular OME-Zarr-native analysis workflows. We demonstrate their use across diverse biological research data, including terabyte-scale multiplexed, volumetric, and time-lapse imaging. In a clinical setting, we show that Fractal workflows achieve near-identical quantification of millions of cells across independent deployments, demonstrating the reproducibility required for translational applications. With its growing community of contributors, the Fractal ecosystem provides a foundation for FAIR microscopy image analysis relying on open file formats.
Ganz, M.; Norgaard, M.; Pernet, C.; Matheson, G. J.; Galassi, A.; Ceballos, E. G.; Wighton, P.; Bilgel, M.; Eierud, C.; Gonzalez-Escamilla, G.; Buckholtz, J.; Blair, R.; Markiewicz, C. J.; Hardcastle, N.; Greve, D. N.; Thomas, A. G.; Poldrack, R. A.; Calhoun, V. D.; Innis, R. B.; Knudsen, G. M.
Show abstract
Molecular neuroimaging with positron emission tomography (PET) and single-photon emission computed tomography (SPECT) enables quantification of specific molecular targets in the living brain. Despite its scientific impact, molecular neuroimaging research has historically faced challenges due to high costs, small sample sizes, laboratory-specific analysis pipelines, and limited large-scale data sharing. These factors have hindered reproducibility and the broader reuse of valuable PET datasets. The OpenNeuroPET initiative was established to address these barriers by developing standards, infrastructure, and open-source tools for organizing, sharing, and analyzing molecular neuroimaging data. Through collaborations across Europe and North America, OpenNeuroPET has supported the PET extension of the Brain Imaging Data Structure (PET-BIDS), providing a standardized framework for PET datasets and metadata. Building on PET-BIDS, tools such as PET2BIDS, ezBIDS, and BIDSCoin facilitate data conversion and curation. In parallel, OpenNeuro now hosts PET-BIDS datasets for open sharing, while complementary platforms such as PublicnEUro enable GDPR-compliant controlled access. Emerging open-source workflows and BIDS applications further support automated, reproducible PET preprocessing and quantitative analysis, promoting harmonized processing across centers. Together, these developments mark an important step toward an open molecular neuroimaging ecosystem in which datasets, software, and workflows can be transparently shared, reused, and scaled for collaborative research.